Overview
Brought to you by YData
Dataset statistics
| Number of variables | 11 |
|---|---|
| Number of observations | 150000 |
| Missing cells | 33655 |
| Missing cells (%) | 2.0% |
| Duplicate rows | 351 |
| Duplicate rows (%) | 0.2% |
| Total size in memory | 13.7 MiB |
| Average record size in memory | 96.0 B |
Variable types
| Categorical | 1 |
|---|---|
| Numeric | 10 |
| Dataset has 351 (0.2%) duplicate rows | Duplicates |
SeriousDlqin2yrs is highly imbalanced (64.6%) | Imbalance |
MonthlyIncome has 29731 (19.8%) missing values | Missing |
NumberOfDependents has 3924 (2.6%) missing values | Missing |
RevolvingUtilizationOfUnsecuredLines is highly skewed (γ1 = 97.63157449) | Skewed |
NumberOfTime30-59DaysPastDueNotWorse is highly skewed (γ1 = 22.59710756) | Skewed |
DebtRatio is highly skewed (γ1 = 95.15779287) | Skewed |
MonthlyIncome is highly skewed (γ1 = 114.0403179) | Skewed |
NumberOfTimes90DaysLate is highly skewed (γ1 = 23.08734547) | Skewed |
NumberOfTime60-89DaysPastDueNotWorse is highly skewed (γ1 = 23.33174312) | Skewed |
RevolvingUtilizationOfUnsecuredLines has 10878 (7.3%) zeros | Zeros |
NumberOfTime30-59DaysPastDueNotWorse has 126018 (84.0%) zeros | Zeros |
DebtRatio has 4113 (2.7%) zeros | Zeros |
MonthlyIncome has 1634 (1.1%) zeros | Zeros |
NumberOfOpenCreditLinesAndLoans has 1888 (1.3%) zeros | Zeros |
NumberOfTimes90DaysLate has 141662 (94.4%) zeros | Zeros |
NumberRealEstateLoansOrLines has 56188 (37.5%) zeros | Zeros |
NumberOfTime60-89DaysPastDueNotWorse has 142396 (94.9%) zeros | Zeros |
NumberOfDependents has 86902 (57.9%) zeros | Zeros |
Reproduction
| Analysis started | 2025-04-25 10:35:23.267785 |
|---|---|
| Analysis finished | 2025-04-25 10:35:30.531620 |
| Duration | 7.26 seconds |
| Software version | ydata-profiling vv4.16.1 |
| Download configuration | config.json |
Variables
SeriousDlqin2yrs
Categorical
Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.3 MiB |
| 0 | |
|---|---|
| 1 | 10026 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 139974 | |
| 1 | 10026 | 6.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 139974 | |
| 1 | 10026 | 6.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 139974 | |
| 1 | 10026 | 6.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 150000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 139974 | |
| 1 | 10026 | 6.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 150000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 139974 | |
| 1 | 10026 | 6.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 150000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 139974 | |
| 1 | 10026 | 6.7% |
RevolvingUtilizationOfUnsecuredLines
Real number (ℝ)
Skewed  Zeros 
| Distinct | 125728 |
|---|---|
| Distinct (%) | 83.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.0484381 |
| Minimum | 0 |
|---|---|
| Maximum | 50708 |
| Zeros | 10878 |
| Zeros (%) | 7.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.029867442 |
| median | 0.15418074 |
| Q3 | 0.55904625 |
| 95-th percentile | 0.9999999 |
| Maximum | 50708 |
| Range | 50708 |
| Interquartile range (IQR) | 0.52917881 |
Descriptive statistics
| Standard deviation | 249.75537 |
|---|---|
| Coefficient of variation (CV) | 41.29254 |
| Kurtosis | 14544.713 |
| Mean | 6.0484381 |
| Median Absolute Deviation (MAD) | 0.14832535 |
| Skewness | 97.631574 |
| Sum | 907265.71 |
| Variance | 62377.745 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 10878 | 7.3% |
| 0.9999999 | 10256 | 6.8% |
| 1 | 17 | < 0.1% |
| 0.9500998 | 8 | < 0.1% |
| 0.007984032 | 6 | < 0.1% |
| 0.954091816 | 6 | < 0.1% |
| 0.71314741 | 6 | < 0.1% |
| 0.796407186 | 5 | < 0.1% |
| 0.988023952 | 5 | < 0.1% |
| 0.994011976 | 5 | < 0.1% |
| Other values (125718) | 128808 |
| Value | Count | Frequency (%) |
| 0 | 10878 | |
| 8.37 × 10-6 | 1 | < 0.1% |
| 9.93 × 10-6 | 1 | < 0.1% |
| 1.25 × 10-5 | 1 | < 0.1% |
| 1.43 × 10-5 | 1 | < 0.1% |
| 1.49 × 10-5 | 1 | < 0.1% |
| 1.51 × 10-5 | 1 | < 0.1% |
| 1.6 × 10-5 | 1 | < 0.1% |
| 1.64 × 10-5 | 1 | < 0.1% |
| 1.87 × 10-5 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 50708 | 1 | |
| 29110 | 1 | |
| 22198 | 1 | |
| 22000 | 1 | |
| 20514 | 1 | |
| 18300 | 1 | |
| 17441 | 1 | |
| 13930 | 1 | |
| 13498 | 1 | |
| 13400 | 1 |
age
Real number (ℝ)
| Distinct | 86 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 52.295207 |
| Minimum | 0 |
|---|---|
| Maximum | 109 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 29 |
| Q1 | 41 |
| median | 52 |
| Q3 | 63 |
| 95-th percentile | 78 |
| Maximum | 109 |
| Range | 109 |
| Interquartile range (IQR) | 22 |
Descriptive statistics
| Standard deviation | 14.771866 |
|---|---|
| Coefficient of variation (CV) | 0.28247074 |
| Kurtosis | -0.49466883 |
| Mean | 52.295207 |
| Median Absolute Deviation (MAD) | 11 |
| Skewness | 0.18899455 |
| Sum | 7844281 |
| Variance | 218.20802 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 49 | 3837 | 2.6% |
| 48 | 3806 | 2.5% |
| 50 | 3753 | 2.5% |
| 47 | 3719 | 2.5% |
| 63 | 3719 | 2.5% |
| 46 | 3714 | 2.5% |
| 53 | 3648 | 2.4% |
| 51 | 3627 | 2.4% |
| 52 | 3609 | 2.4% |
| 56 | 3589 | 2.4% |
| Other values (76) | 112979 |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 21 | 183 | 0.1% |
| 22 | 434 | 0.3% |
| 23 | 641 | 0.4% |
| 24 | 816 | |
| 25 | 953 | |
| 26 | 1193 | |
| 27 | 1338 | |
| 28 | 1560 | |
| 29 | 1702 |
| Value | Count | Frequency (%) |
| 109 | 2 | < 0.1% |
| 107 | 1 | < 0.1% |
| 105 | 1 | < 0.1% |
| 103 | 3 | < 0.1% |
| 102 | 3 | < 0.1% |
| 101 | 3 | < 0.1% |
| 99 | 9 | |
| 98 | 6 | < 0.1% |
| 97 | 17 | |
| 96 | 18 |
NumberOfTime30-59DaysPastDueNotWorse
Real number (ℝ)
Skewed  Zeros 
| Distinct | 16 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.42103333 |
| Minimum | 0 |
|---|---|
| Maximum | 98 |
| Zeros | 126018 |
| Zeros (%) | 84.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 98 |
| Range | 98 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 4.1927813 |
|---|---|
| Coefficient of variation (CV) | 9.9583119 |
| Kurtosis | 522.37654 |
| Mean | 0.42103333 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 22.597108 |
| Sum | 63155 |
| Variance | 17.579415 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 126018 | |
| 1 | 16033 | 10.7% |
| 2 | 4598 | 3.1% |
| 3 | 1754 | 1.2% |
| 4 | 747 | 0.5% |
| 5 | 342 | 0.2% |
| 98 | 264 | 0.2% |
| 6 | 140 | 0.1% |
| 7 | 54 | < 0.1% |
| 8 | 25 | < 0.1% |
| Other values (6) | 25 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 126018 | |
| 1 | 16033 | 10.7% |
| 2 | 4598 | 3.1% |
| 3 | 1754 | 1.2% |
| 4 | 747 | 0.5% |
| 5 | 342 | 0.2% |
| 6 | 140 | 0.1% |
| 7 | 54 | < 0.1% |
| 8 | 25 | < 0.1% |
| 9 | 12 | < 0.1% |
| Value | Count | Frequency (%) |
| 98 | 264 | |
| 96 | 5 | < 0.1% |
| 13 | 1 | < 0.1% |
| 12 | 2 | < 0.1% |
| 11 | 1 | < 0.1% |
| 10 | 4 | < 0.1% |
| 9 | 12 | < 0.1% |
| 8 | 25 | < 0.1% |
| 7 | 54 | < 0.1% |
| 6 | 140 |
DebtRatio
Real number (ℝ)
Skewed  Zeros 
| Distinct | 114194 |
|---|---|
| Distinct (%) | 76.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 353.00508 |
| Minimum | 0 |
|---|---|
| Maximum | 329664 |
| Zeros | 4113 |
| Zeros (%) | 2.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.004329004 |
| Q1 | 0.17507383 |
| median | 0.36650784 |
| Q3 | 0.86825377 |
| 95-th percentile | 2449 |
| Maximum | 329664 |
| Range | 329664 |
| Interquartile range (IQR) | 0.69317994 |
Descriptive statistics
| Standard deviation | 2037.8185 |
|---|---|
| Coefficient of variation (CV) | 5.772774 |
| Kurtosis | 13734.289 |
| Mean | 353.00508 |
| Median Absolute Deviation (MAD) | 0.2457228 |
| Skewness | 95.157793 |
| Sum | 52950761 |
| Variance | 4152704.3 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 4113 | 2.7% |
| 1 | 229 | 0.2% |
| 4 | 174 | 0.1% |
| 2 | 170 | 0.1% |
| 3 | 162 | 0.1% |
| 5 | 143 | 0.1% |
| 9 | 125 | 0.1% |
| 10 | 117 | 0.1% |
| 7 | 115 | 0.1% |
| 13 | 114 | 0.1% |
| Other values (114184) | 144538 |
| Value | Count | Frequency (%) |
| 0 | 4113 | |
| 2.6 × 10-5 | 1 | < 0.1% |
| 3.69 × 10-5 | 1 | < 0.1% |
| 3.93 × 10-5 | 1 | < 0.1% |
| 6.62 × 10-5 | 1 | < 0.1% |
| 7.5 × 10-5 | 1 | < 0.1% |
| 8 × 10-5 | 1 | < 0.1% |
| 8.57 × 10-5 | 1 | < 0.1% |
| 9.09 × 10-5 | 1 | < 0.1% |
| 9.15 × 10-5 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 329664 | 1 | |
| 326442 | 1 | |
| 307001 | 1 | |
| 220516 | 1 | |
| 168835 | 1 | |
| 110952 | 1 | |
| 106885 | 1 | |
| 101320 | 1 | |
| 61907 | 1 | |
| 61106.5 | 1 |
MonthlyIncome
Real number (ℝ)
Missing  Skewed  Zeros 
| Distinct | 13594 |
|---|---|
| Distinct (%) | 11.3% |
| Missing | 29731 |
| Missing (%) | 19.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6670.2212 |
| Minimum | 0 |
|---|---|
| Maximum | 3008750 |
| Zeros | 1634 |
| Zeros (%) | 1.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1300 |
| Q1 | 3400 |
| median | 5400 |
| Q3 | 8249 |
| 95-th percentile | 14587.6 |
| Maximum | 3008750 |
| Range | 3008750 |
| Interquartile range (IQR) | 4849 |
Descriptive statistics
| Standard deviation | 14384.674 |
|---|---|
| Coefficient of variation (CV) | 2.1565513 |
| Kurtosis | 19504.705 |
| Mean | 6670.2212 |
| Median Absolute Deviation (MAD) | 2317 |
| Skewness | 114.04032 |
| Sum | 8.0222084 × 108 |
| Variance | 2.0691885 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5000 | 2757 | 1.8% |
| 4000 | 2106 | 1.4% |
| 6000 | 1934 | 1.3% |
| 3000 | 1758 | 1.2% |
| 0 | 1634 | 1.1% |
| 2500 | 1551 | 1.0% |
| 10000 | 1466 | 1.0% |
| 3500 | 1360 | 0.9% |
| 4500 | 1226 | 0.8% |
| 7000 | 1223 | 0.8% |
| Other values (13584) | 103254 | |
| (Missing) | 29731 | 19.8% |
| Value | Count | Frequency (%) |
| 0 | 1634 | |
| 1 | 605 | 0.4% |
| 2 | 6 | < 0.1% |
| 4 | 2 | < 0.1% |
| 5 | 2 | < 0.1% |
| 7 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| 10 | 2 | < 0.1% |
| 11 | 1 | < 0.1% |
| 15 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 3008750 | 1 | |
| 1794060 | 1 | |
| 1560100 | 1 | |
| 1072500 | 1 | |
| 835040 | 1 | |
| 730483 | 1 | |
| 702500 | 1 | |
| 699530 | 1 | |
| 649587 | 1 | |
| 629000 | 1 |
NumberOfOpenCreditLinesAndLoans
Real number (ℝ)
Zeros 
| Distinct | 58 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.45276 |
| Minimum | 0 |
|---|---|
| Maximum | 58 |
| Zeros | 1888 |
| Zeros (%) | 1.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 5 |
| median | 8 |
| Q3 | 11 |
| 95-th percentile | 18 |
| Maximum | 58 |
| Range | 58 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 5.145951 |
|---|---|
| Coefficient of variation (CV) | 0.60878944 |
| Kurtosis | 3.0910667 |
| Mean | 8.45276 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 1.2153138 |
| Sum | 1267914 |
| Variance | 26.480812 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6 | 13614 | 9.1% |
| 7 | 13245 | 8.8% |
| 5 | 12931 | 8.6% |
| 8 | 12562 | 8.4% |
| 4 | 11609 | 7.7% |
| 9 | 11355 | 7.6% |
| 10 | 9624 | 6.4% |
| 3 | 9058 | 6.0% |
| 11 | 8321 | 5.5% |
| 12 | 7005 | 4.7% |
| Other values (48) | 40676 |
| Value | Count | Frequency (%) |
| 0 | 1888 | 1.3% |
| 1 | 4438 | 3.0% |
| 2 | 6666 | |
| 3 | 9058 | |
| 4 | 11609 | |
| 5 | 12931 | |
| 6 | 13614 | |
| 7 | 13245 | |
| 8 | 12562 | |
| 9 | 11355 |
| Value | Count | Frequency (%) |
| 58 | 1 | < 0.1% |
| 57 | 2 | < 0.1% |
| 56 | 2 | < 0.1% |
| 54 | 4 | |
| 53 | 1 | < 0.1% |
| 52 | 3 | |
| 51 | 2 | < 0.1% |
| 50 | 2 | < 0.1% |
| 49 | 4 | |
| 48 | 6 |
NumberOfTimes90DaysLate
Real number (ℝ)
Skewed  Zeros 
| Distinct | 19 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.26597333 |
| Minimum | 0 |
|---|---|
| Maximum | 98 |
| Zeros | 141662 |
| Zeros (%) | 94.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 98 |
| Range | 98 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 4.1693038 |
|---|---|
| Coefficient of variation (CV) | 15.675646 |
| Kurtosis | 537.73894 |
| Mean | 0.26597333 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 23.087345 |
| Sum | 39896 |
| Variance | 17.383094 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 141662 | |
| 1 | 5243 | 3.5% |
| 2 | 1555 | 1.0% |
| 3 | 667 | 0.4% |
| 4 | 291 | 0.2% |
| 98 | 264 | 0.2% |
| 5 | 131 | 0.1% |
| 6 | 80 | 0.1% |
| 7 | 38 | < 0.1% |
| 8 | 21 | < 0.1% |
| Other values (9) | 48 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 141662 | |
| 1 | 5243 | 3.5% |
| 2 | 1555 | 1.0% |
| 3 | 667 | 0.4% |
| 4 | 291 | 0.2% |
| 5 | 131 | 0.1% |
| 6 | 80 | 0.1% |
| 7 | 38 | < 0.1% |
| 8 | 21 | < 0.1% |
| 9 | 19 | < 0.1% |
| Value | Count | Frequency (%) |
| 98 | 264 | |
| 96 | 5 | < 0.1% |
| 17 | 1 | < 0.1% |
| 15 | 2 | < 0.1% |
| 14 | 2 | < 0.1% |
| 13 | 4 | < 0.1% |
| 12 | 2 | < 0.1% |
| 11 | 5 | < 0.1% |
| 10 | 8 | < 0.1% |
| 9 | 19 | < 0.1% |
NumberRealEstateLoansOrLines
Real number (ℝ)
Zeros 
| Distinct | 28 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.01824 |
| Minimum | 0 |
|---|---|
| Maximum | 54 |
| Zeros | 56188 |
| Zeros (%) | 37.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 54 |
| Range | 54 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.129771 |
|---|---|
| Coefficient of variation (CV) | 1.1095331 |
| Kurtosis | 60.476808 |
| Mean | 1.01824 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 3.482484 |
| Sum | 152736 |
| Variance | 1.2763825 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 56188 | |
| 1 | 52338 | |
| 2 | 31522 | |
| 3 | 6300 | 4.2% |
| 4 | 2170 | 1.4% |
| 5 | 689 | 0.5% |
| 6 | 320 | 0.2% |
| 7 | 171 | 0.1% |
| 8 | 93 | 0.1% |
| 9 | 78 | 0.1% |
| Other values (18) | 131 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 56188 | |
| 1 | 52338 | |
| 2 | 31522 | |
| 3 | 6300 | 4.2% |
| 4 | 2170 | 1.4% |
| 5 | 689 | 0.5% |
| 6 | 320 | 0.2% |
| 7 | 171 | 0.1% |
| 8 | 93 | 0.1% |
| 9 | 78 | 0.1% |
| Value | Count | Frequency (%) |
| 54 | 1 | < 0.1% |
| 32 | 1 | < 0.1% |
| 29 | 1 | < 0.1% |
| 26 | 1 | < 0.1% |
| 25 | 3 | |
| 23 | 2 | |
| 21 | 1 | < 0.1% |
| 20 | 2 | |
| 19 | 2 | |
| 18 | 2 |
NumberOfTime60-89DaysPastDueNotWorse
Real number (ℝ)
Skewed  Zeros 
| Distinct | 13 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.24038667 |
| Minimum | 0 |
|---|---|
| Maximum | 98 |
| Zeros | 142396 |
| Zeros (%) | 94.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 98 |
| Range | 98 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 4.1551794 |
|---|---|
| Coefficient of variation (CV) | 17.285399 |
| Kurtosis | 545.68274 |
| Mean | 0.24038667 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 23.331743 |
| Sum | 36058 |
| Variance | 17.265516 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 142396 | |
| 1 | 5731 | 3.8% |
| 2 | 1118 | 0.7% |
| 3 | 318 | 0.2% |
| 98 | 264 | 0.2% |
| 4 | 105 | 0.1% |
| 5 | 34 | < 0.1% |
| 6 | 16 | < 0.1% |
| 7 | 9 | < 0.1% |
| 96 | 5 | < 0.1% |
| Other values (3) | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 142396 | |
| 1 | 5731 | 3.8% |
| 2 | 1118 | 0.7% |
| 3 | 318 | 0.2% |
| 4 | 105 | 0.1% |
| 5 | 34 | < 0.1% |
| 6 | 16 | < 0.1% |
| 7 | 9 | < 0.1% |
| 8 | 2 | < 0.1% |
| 9 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 98 | 264 | |
| 96 | 5 | < 0.1% |
| 11 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| 8 | 2 | < 0.1% |
| 7 | 9 | < 0.1% |
| 6 | 16 | < 0.1% |
| 5 | 34 | < 0.1% |
| 4 | 105 | 0.1% |
| 3 | 318 |
NumberOfDependents
Real number (ℝ)
Missing  Zeros 
| Distinct | 13 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3924 |
| Missing (%) | 2.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.75722227 |
| Minimum | 0 |
|---|---|
| Maximum | 20 |
| Zeros | 86902 |
| Zeros (%) | 57.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 3 |
| Maximum | 20 |
| Range | 20 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.1150861 |
|---|---|
| Coefficient of variation (CV) | 1.4726007 |
| Kurtosis | 3.0016568 |
| Mean | 0.75722227 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.5882424 |
| Sum | 110612 |
| Variance | 1.2434169 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 86902 | |
| 1 | 26316 | 17.5% |
| 2 | 19522 | 13.0% |
| 3 | 9483 | 6.3% |
| 4 | 2862 | 1.9% |
| 5 | 746 | 0.5% |
| 6 | 158 | 0.1% |
| 7 | 51 | < 0.1% |
| 8 | 24 | < 0.1% |
| 10 | 5 | < 0.1% |
| Other values (3) | 7 | < 0.1% |
| (Missing) | 3924 | 2.6% |
| Value | Count | Frequency (%) |
| 0 | 86902 | |
| 1 | 26316 | 17.5% |
| 2 | 19522 | 13.0% |
| 3 | 9483 | 6.3% |
| 4 | 2862 | 1.9% |
| 5 | 746 | 0.5% |
| 6 | 158 | 0.1% |
| 7 | 51 | < 0.1% |
| 8 | 24 | < 0.1% |
| 9 | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 20 | 1 | < 0.1% |
| 13 | 1 | < 0.1% |
| 10 | 5 | < 0.1% |
| 9 | 5 | < 0.1% |
| 8 | 24 | < 0.1% |
| 7 | 51 | < 0.1% |
| 6 | 158 | 0.1% |
| 5 | 746 | 0.5% |
| 4 | 2862 | 1.9% |
| 3 | 9483 |
Interactions
Correlations
| DebtRatio | MonthlyIncome | NumberOfDependents | NumberOfOpenCreditLinesAndLoans | NumberOfTime30-59DaysPastDueNotWorse | NumberOfTime60-89DaysPastDueNotWorse | NumberOfTimes90DaysLate | NumberRealEstateLoansOrLines | RevolvingUtilizationOfUnsecuredLines | SeriousDlqin2yrs | age | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| DebtRatio | 1.000 | -0.131 | -0.038 | 0.227 | 0.038 | 0.001 | -0.032 | 0.400 | 0.077 | 0.000 | 0.029 |
| MonthlyIncome | -0.131 | 1.000 | 0.204 | 0.312 | -0.015 | -0.053 | -0.088 | 0.391 | -0.078 | 0.000 | 0.135 |
| NumberOfDependents | -0.038 | 0.204 | 1.000 | 0.100 | 0.071 | 0.035 | 0.030 | 0.166 | 0.118 | 0.031 | -0.228 |
| NumberOfOpenCreditLinesAndLoans | 0.227 | 0.312 | 0.100 | 1.000 | 0.064 | -0.048 | -0.135 | 0.473 | -0.087 | 0.049 | 0.158 |
| NumberOfTime30-59DaysPastDueNotWorse | 0.038 | -0.015 | 0.071 | 0.064 | 1.000 | 0.280 | 0.253 | 0.022 | 0.234 | 0.084 | -0.095 |
| NumberOfTime60-89DaysPastDueNotWorse | 0.001 | -0.053 | 0.035 | -0.048 | 0.280 | 1.000 | 0.321 | -0.044 | 0.188 | 0.082 | -0.085 |
| NumberOfTimes90DaysLate | -0.032 | -0.088 | 0.030 | -0.135 | 0.253 | 0.321 | 1.000 | -0.101 | 0.238 | 0.085 | -0.104 |
| NumberRealEstateLoansOrLines | 0.400 | 0.391 | 0.166 | 0.473 | 0.022 | -0.044 | -0.101 | 1.000 | -0.027 | 0.033 | 0.054 |
| RevolvingUtilizationOfUnsecuredLines | 0.077 | -0.078 | 0.118 | -0.087 | 0.234 | 0.188 | 0.238 | -0.027 | 1.000 | 0.000 | -0.278 |
| SeriousDlqin2yrs | 0.000 | 0.000 | 0.031 | 0.049 | 0.084 | 0.082 | 0.085 | 0.033 | 0.000 | 1.000 | 0.116 |
| age | 0.029 | 0.135 | -0.228 | 0.158 | -0.095 | -0.085 | -0.104 | 0.054 | -0.278 | 0.116 | 1.000 |
Missing values
Sample
| SeriousDlqin2yrs | RevolvingUtilizationOfUnsecuredLines | age | NumberOfTime30-59DaysPastDueNotWorse | DebtRatio | MonthlyIncome | NumberOfOpenCreditLinesAndLoans | NumberOfTimes90DaysLate | NumberRealEstateLoansOrLines | NumberOfTime60-89DaysPastDueNotWorse | NumberOfDependents | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 1 | 0.766127 | 45 | 2 | 0.802982 | 9120.0 | 13 | 0 | 6 | 0 | 2.0 |
| 2 | 0 | 0.957151 | 40 | 0 | 0.121876 | 2600.0 | 4 | 0 | 0 | 0 | 1.0 |
| 3 | 0 | 0.658180 | 38 | 1 | 0.085113 | 3042.0 | 2 | 1 | 0 | 0 | 0.0 |
| 4 | 0 | 0.233810 | 30 | 0 | 0.036050 | 3300.0 | 5 | 0 | 0 | 0 | 0.0 |
| 5 | 0 | 0.907239 | 49 | 1 | 0.024926 | 63588.0 | 7 | 0 | 1 | 0 | 0.0 |
| 6 | 0 | 0.213179 | 74 | 0 | 0.375607 | 3500.0 | 3 | 0 | 1 | 0 | 1.0 |
| 7 | 0 | 0.305682 | 57 | 0 | 5710.000000 | NaN | 8 | 0 | 3 | 0 | 0.0 |
| 8 | 0 | 0.754464 | 39 | 0 | 0.209940 | 3500.0 | 8 | 0 | 0 | 0 | 0.0 |
| 9 | 0 | 0.116951 | 27 | 0 | 46.000000 | NaN | 2 | 0 | 0 | 0 | NaN |
| 10 | 0 | 0.189169 | 57 | 0 | 0.606291 | 23684.0 | 9 | 0 | 4 | 0 | 2.0 |
| SeriousDlqin2yrs | RevolvingUtilizationOfUnsecuredLines | age | NumberOfTime30-59DaysPastDueNotWorse | DebtRatio | MonthlyIncome | NumberOfOpenCreditLinesAndLoans | NumberOfTimes90DaysLate | NumberRealEstateLoansOrLines | NumberOfTime60-89DaysPastDueNotWorse | NumberOfDependents | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 149991 | 0 | 0.055518 | 46 | 0 | 0.609779 | 4335.0 | 7 | 0 | 1 | 0 | 2.0 |
| 149992 | 0 | 0.104112 | 59 | 0 | 0.477658 | 10316.0 | 10 | 0 | 2 | 0 | 0.0 |
| 149993 | 0 | 0.871976 | 50 | 0 | 4132.000000 | NaN | 11 | 0 | 1 | 0 | 3.0 |
| 149994 | 0 | 1.000000 | 22 | 0 | 0.000000 | 820.0 | 1 | 0 | 0 | 0 | 0.0 |
| 149995 | 0 | 0.385742 | 50 | 0 | 0.404293 | 3400.0 | 7 | 0 | 0 | 0 | 0.0 |
| 149996 | 0 | 0.040674 | 74 | 0 | 0.225131 | 2100.0 | 4 | 0 | 1 | 0 | 0.0 |
| 149997 | 0 | 0.299745 | 44 | 0 | 0.716562 | 5584.0 | 4 | 0 | 1 | 0 | 2.0 |
| 149998 | 0 | 0.246044 | 58 | 0 | 3870.000000 | NaN | 18 | 0 | 1 | 0 | 0.0 |
| 149999 | 0 | 0.000000 | 30 | 0 | 0.000000 | 5716.0 | 4 | 0 | 0 | 0 | 0.0 |
| 150000 | 0 | 0.850283 | 64 | 0 | 0.249908 | 8158.0 | 8 | 0 | 2 | 0 | 0.0 |
Duplicate rows
Most frequently occurring
| SeriousDlqin2yrs | RevolvingUtilizationOfUnsecuredLines | age | NumberOfTime30-59DaysPastDueNotWorse | DebtRatio | MonthlyIncome | NumberOfOpenCreditLinesAndLoans | NumberOfTimes90DaysLate | NumberRealEstateLoansOrLines | NumberOfTime60-89DaysPastDueNotWorse | NumberOfDependents | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 202 | 0 | 1.0 | 22 | 0 | 0.0 | 820.0 | 1 | 0 | 0 | 0 | 0.0 | 12 |
| 211 | 0 | 1.0 | 22 | 98 | 0.0 | NaN | 0 | 98 | 0 | 98 | 0.0 | 10 |
| 5 | 0 | 0.0 | 22 | 0 | 0.0 | 820.0 | 2 | 0 | 0 | 0 | 0.0 | 8 |
| 9 | 0 | 0.0 | 22 | 0 | 0.0 | NaN | 1 | 0 | 0 | 0 | 0.0 | 7 |
| 215 | 0 | 1.0 | 23 | 0 | 0.0 | 820.0 | 1 | 0 | 0 | 0 | 0.0 | 7 |
| 216 | 0 | 1.0 | 23 | 0 | 0.0 | NaN | 0 | 0 | 0 | 0 | 0.0 | 7 |
| 16 | 0 | 0.0 | 23 | 0 | 0.0 | NaN | 1 | 0 | 0 | 0 | 0.0 | 6 |
| 19 | 0 | 0.0 | 24 | 0 | 0.0 | 820.0 | 2 | 0 | 0 | 0 | 0.0 | 6 |
| 70 | 0 | 0.0 | 60 | 0 | 0.0 | NaN | 1 | 0 | 0 | 0 | 0.0 | 6 |
| 261 | 0 | 1.0 | 37 | 0 | 0.0 | NaN | 0 | 0 | 0 | 0 | 0.0 | 6 |